Visual Cluster Analysis in Data Mining

نویسندگان

  • Ke-Bing Zhang
  • Zhang Xiao-lin
  • Mehmet A. Orgun
چکیده

Clustering is a major technique in data mining. However the numeri-cal feedback of clustering algorithms is difficult for user to have an intuitiveoverview of the dataset that they deal with. Visualization has been proven to bevery helpful for high-dimensional data analysis. Therefore it is desirable to in-troduce visualization techniques with user’s domain knowledge into clusteringprocess. Whereas most existing visualization techniques used in clustering areexploration oriented. Inevitably, they are mainly stochastic and subjective in na-ture. In this paper, we introduce an approach called HOV (Hypothesis OrientedVerification and Validation by Visualization), which projects high-dimensionaldata on the 2D space and reflects data distribution based on user hypotheses. Inaddition, HOV enables user to adjust hypotheses iteratively in order to obtainan optimized view. As a result, HOV provides user an efficient and effectivevisualization method to explore cluster information.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Geo-visualization Support for Multidimensional Clustering

In this paper we consider how multidimensional clustering can be complemented by interactive visualization. We propose a link between geovisualization and data mining systems for supporting an iterative analysis cycle, including data pre-processing and visual exploration, automatic detection of clusters in multidimensional space of user-selected attributes, and visual analysis of cluster analys...

متن کامل

Methods for the Visualization of Clustered Climate Data

Increasing amounts of large climate data require new analysis techniques. The area of data mining investigates new paradigms and methods including factors like scalability, flexibility and problem abstraction for large data sets. The field of visual data mining in particular offers valuable methods for analyzing large amounts of data intuitively. In this paper we describe our approach of integr...

متن کامل

Data Mining

Data Mining provides approaches for the identification and discovery of non-trivial patterns and models hidden in large collections of data. In the applied natural language processing domain, data mining usually requires preprocessed data that has been extracted from textual documents. Additionally, this data is often integrated with other data sources. This chapter provides an overview on data...

متن کامل

Steerable Clustering for Visual Analysis of Ecosystems

One of the great challenges in the geosciences is understanding ecological systems in order to predict changes and responses in space and time at scales from local to global. Ecologists are starting to recognize the value of analysis methods that go beyond statistics to include data mining, visual representations, and combinations of these in computational tools. However, the tools in use today...

متن کامل

A hybrid Algorithm for Epidemic Disease Prediction with Multi Dimensional Data

Data mining is has three major components Clustering or Classification, Association Rules and Sequence Analysis. The clustering techniques analyze a set of data and generate a set of grouping rules that can be used to classify future data. The mining tool automatically identifies the clusters, by studying the pattern in the training data. Once the clusters are generated, classification can be u...

متن کامل

An Empirical Study on the Visual Cluster Validation Method with Fastmap

This paper presents an empirical study on the visual method for cluster validation based on the Fastmap projection. The visual cluster validation method attempts to tackle two clustering problems in data mining: ( I ) to veri f y partitions of data created by a clustering algorithm and ( 2 ) to identify genuine clusters from data partitions. They are achieved through projecting objects and clus...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007